Overview

Brought to you by YData

Dataset statistics

Number of variables18
Number of observations377185
Missing cells1869153
Missing cells (%)27.5%
Duplicate rows49
Duplicate rows (%)< 0.1%
Total size in memory51.8 MiB
Average record size in memory144.0 B

Variable types

Text15
Boolean2
Categorical1

Alerts

private pool has constant value "True" Constant
PrivatePool has constant value "True" Constant
Dataset has 49 (< 0.1%) duplicate rowsDuplicates
status has 39918 (10.6%) missing values Missing
private pool has 373004 (98.9%) missing values Missing
propertyType has 34733 (9.2%) missing values Missing
baths has 106338 (28.2%) missing values Missing
fireplace has 274071 (72.7%) missing values Missing
sqft has 40577 (10.8%) missing values Missing
beds has 91282 (24.2%) missing values Missing
stories has 150716 (40.0%) missing values Missing
mls-id has 352243 (93.4%) missing values Missing
PrivatePool has 336874 (89.3%) missing values Missing
MlsId has 66880 (17.7%) missing values Missing

Reproduction

Analysis started2024-11-26 09:56:20.878083
Analysis finished2024-11-26 09:57:44.117908
Duration1 minute and 23.24 seconds
Software versionydata-profiling vv4.12.0
Download configurationconfig.json

Variables

status
Text

Missing 

Distinct159
Distinct (%)< 0.1%
Missing39918
Missing (%)10.6%
Memory size2.9 MiB
2024-11-26T12:57:44.453379image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length38
Median length8
Mean length7.8409183
Min length1

Characters and Unicode

Total characters2644483
Distinct characters62
Distinct categories10 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)< 0.1%

Sample

1st rowActive
2nd rowfor sale
3rd rowfor sale
4th rowfor sale
5th rowfor sale
ValueCountFrequency (%)
for 199983
35.8%
sale 199634
35.8%
active 106540
19.1%
foreclosure 6771
 
1.2%
new 6165
 
1.1%
construction 5475
 
1.0%
pending 5364
 
1.0%
contract 3802
 
0.7%
pre-foreclosure 3679
 
0.7%
under 3661
 
0.7%
Other values (125) 17013
 
3.0%
2024-11-26T12:57:44.889130image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 351490
13.3%
o 244796
9.3%
r 239503
9.1%
223406
8.4%
s 217112
8.2%
l 211183
8.0%
a 207546
7.8%
f 166776
 
6.3%
c 137402
 
5.2%
t 132032
 
5.0%
Other values (52) 513237
19.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2230711
84.4%
Space Separator 223406
 
8.4%
Uppercase Letter 183616
 
6.9%
Dash Punctuation 3764
 
0.1%
Other Punctuation 2762
 
0.1%
Decimal Number 203
 
< 0.1%
Open Punctuation 7
 
< 0.1%
Close Punctuation 7
 
< 0.1%
Math Symbol 5
 
< 0.1%
Currency Symbol 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 351490
15.8%
o 244796
11.0%
r 239503
10.7%
s 217112
9.7%
l 211183
9.5%
a 207546
9.3%
f 166776
7.5%
c 137402
 
6.2%
t 132032
 
5.9%
i 124434
 
5.6%
Other values (14) 198437
8.9%
Uppercase Letter
ValueCountFrequency (%)
A 107839
58.7%
F 44536
24.3%
P 11133
 
6.1%
N 6212
 
3.4%
C 5915
 
3.2%
U 3674
 
2.0%
S 2412
 
1.3%
B 539
 
0.3%
T 311
 
0.2%
I 302
 
0.2%
Other values (8) 743
 
0.4%
Decimal Number
ValueCountFrequency (%)
2 44
21.7%
1 38
18.7%
3 19
9.4%
9 18
8.9%
0 18
8.9%
4 18
8.9%
8 13
 
6.4%
5 13
 
6.4%
7 11
 
5.4%
6 11
 
5.4%
Other Punctuation
ValueCountFrequency (%)
/ 2535
91.8%
. 112
 
4.1%
: 112
 
4.1%
, 3
 
0.1%
Space Separator
ValueCountFrequency (%)
223406
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3764
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Math Symbol
ValueCountFrequency (%)
+ 5
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2414327
91.3%
Common 230156
 
8.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 351490
14.6%
o 244796
10.1%
r 239503
9.9%
s 217112
9.0%
l 211183
8.7%
a 207546
8.6%
f 166776
6.9%
c 137402
 
5.7%
t 132032
 
5.5%
i 124434
 
5.2%
Other values (32) 382053
15.8%
Common
ValueCountFrequency (%)
223406
97.1%
- 3764
 
1.6%
/ 2535
 
1.1%
. 112
 
< 0.1%
: 112
 
< 0.1%
2 44
 
< 0.1%
1 38
 
< 0.1%
3 19
 
< 0.1%
9 18
 
< 0.1%
0 18
 
< 0.1%
Other values (10) 90
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2644483
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 351490
13.3%
o 244796
9.3%
r 239503
9.1%
223406
8.4%
s 217112
8.2%
l 211183
8.0%
a 207546
7.8%
f 166776
 
6.3%
c 137402
 
5.2%
t 132032
 
5.0%
Other values (52) 513237
19.4%

private pool
Boolean

Constant  Missing 

Distinct1
Distinct (%)< 0.1%
Missing373004
Missing (%)98.9%
Memory size736.8 KiB
True
 
4181
(Missing)
373004 
ValueCountFrequency (%)
True 4181
 
1.1%
(Missing) 373004
98.9%
2024-11-26T12:57:45.048527image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

propertyType
Text

Missing 

Distinct1280
Distinct (%)0.4%
Missing34733
Missing (%)9.2%
Memory size2.9 MiB
2024-11-26T12:57:45.526320image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length129
Median length112
Mean length13.522067
Min length1

Characters and Unicode

Total characters4630659
Distinct characters68
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique615 ?
Unique (%)0.2%

Sample

1st rowSingle Family Home
2nd rowsingle-family home
3rd rowsingle-family home
4th rowsingle-family home
5th rowlot/land
ValueCountFrequency (%)
home 126898
21.0%
single 98013
16.2%
family 97391
16.1%
single-family 92206
15.2%
condo 42532
 
7.0%
lot/land 20552
 
3.4%
townhouse 18579
 
3.1%
land 10939
 
1.8%
traditional 9679
 
1.6%
multi-family 9424
 
1.6%
Other values (277) 79448
13.1%
2024-11-26T12:57:46.132869image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
l 465212
 
10.0%
i 441168
 
9.5%
e 398458
 
8.6%
o 376556
 
8.1%
m 361450
 
7.8%
n 336503
 
7.3%
a 283147
 
6.1%
263457
 
5.7%
y 210598
 
4.5%
g 193479
 
4.2%
Other values (58) 1300631
28.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3804148
82.2%
Uppercase Letter 383104
 
8.3%
Space Separator 263457
 
5.7%
Dash Punctuation 110549
 
2.4%
Other Punctuation 62907
 
1.4%
Decimal Number 4641
 
0.1%
Open Punctuation 805
 
< 0.1%
Close Punctuation 805
 
< 0.1%
Math Symbol 243
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S 107851
28.2%
F 100978
26.4%
H 44930
11.7%
C 41930
 
10.9%
T 27723
 
7.2%
R 13823
 
3.6%
L 12233
 
3.2%
M 12047
 
3.1%
O 11386
 
3.0%
D 6101
 
1.6%
Other values (15) 4102
 
1.1%
Lowercase Letter
ValueCountFrequency (%)
l 465212
12.2%
i 441168
11.6%
e 398458
10.5%
o 376556
9.9%
m 361450
9.5%
n 336503
8.8%
a 283147
7.4%
y 210598
 
5.5%
g 193479
 
5.1%
h 133059
 
3.5%
Other values (14) 604518
15.9%
Decimal Number
ValueCountFrequency (%)
1 2284
49.2%
2 2027
43.7%
3 161
 
3.5%
4 85
 
1.8%
8 38
 
0.8%
7 15
 
0.3%
0 14
 
0.3%
5 11
 
0.2%
9 6
 
0.1%
Other Punctuation
ValueCountFrequency (%)
/ 51353
81.6%
, 11522
 
18.3%
& 31
 
< 0.1%
. 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
+ 148
60.9%
< 95
39.1%
Space Separator
ValueCountFrequency (%)
263457
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 110549
100.0%
Open Punctuation
ValueCountFrequency (%)
( 805
100.0%
Close Punctuation
ValueCountFrequency (%)
) 805
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4187252
90.4%
Common 443407
 
9.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
l 465212
11.1%
i 441168
10.5%
e 398458
9.5%
o 376556
 
9.0%
m 361450
 
8.6%
n 336503
 
8.0%
a 283147
 
6.8%
y 210598
 
5.0%
g 193479
 
4.6%
h 133059
 
3.2%
Other values (39) 987622
23.6%
Common
ValueCountFrequency (%)
263457
59.4%
- 110549
24.9%
/ 51353
 
11.6%
, 11522
 
2.6%
1 2284
 
0.5%
2 2027
 
0.5%
( 805
 
0.2%
) 805
 
0.2%
3 161
 
< 0.1%
+ 148
 
< 0.1%
Other values (9) 296
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4630659
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
l 465212
 
10.0%
i 441168
 
9.5%
e 398458
 
8.6%
o 376556
 
8.1%
m 361450
 
7.8%
n 336503
 
7.3%
a 283147
 
6.1%
263457
 
5.7%
y 210598
 
4.5%
g 193479
 
4.2%
Other values (58) 1300631
28.1%

street
Text

Distinct337076
Distinct (%)89.4%
Missing2
Missing (%)< 0.1%
Memory size2.9 MiB
2024-11-26T12:57:46.552549image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length96
Median length83
Mean length18.581755
Min length1

Characters and Unicode

Total characters7008722
Distinct characters87
Distinct categories12 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique302913 ?
Unique (%)80.3%

Sample

1st row240 Heather Ln
2nd row12911 E Heroy Ave
3rd row2005 Westridge Rd
4th row4311 Livingston Ave
5th row1524 Kiscoe St
ValueCountFrequency (%)
st 83457
 
5.7%
dr 64519
 
4.4%
ave 62480
 
4.2%
rd 32631
 
2.2%
ln 23003
 
1.6%
n 18922
 
1.3%
w 18434
 
1.3%
ct 17819
 
1.2%
s 17722
 
1.2%
sw 15625
 
1.1%
Other values (69379) 1116671
75.9%
2024-11-26T12:57:47.115673image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1155250
 
16.5%
e 358127
 
5.1%
1 338548
 
4.8%
r 287515
 
4.1%
t 285628
 
4.1%
a 269227
 
3.8%
n 248642
 
3.5%
0 232055
 
3.3%
2 222173
 
3.2%
l 209098
 
3.0%
Other values (77) 3402459
48.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2965336
42.3%
Decimal Number 1725342
24.6%
Space Separator 1155250
 
16.5%
Uppercase Letter 1085502
 
15.5%
Other Punctuation 70763
 
1.0%
Dash Punctuation 5110
 
0.1%
Close Punctuation 669
 
< 0.1%
Open Punctuation 666
 
< 0.1%
Math Symbol 73
 
< 0.1%
Currency Symbol 7
 
< 0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 358127
12.1%
r 287515
9.7%
t 285628
9.6%
a 269227
9.1%
n 248642
 
8.4%
l 209098
 
7.1%
o 197816
 
6.7%
i 188590
 
6.4%
d 153033
 
5.2%
s 124868
 
4.2%
Other values (16) 642792
21.7%
Uppercase Letter
ValueCountFrequency (%)
S 168200
15.5%
A 103962
 
9.6%
D 83434
 
7.7%
C 80923
 
7.5%
W 79967
 
7.4%
R 62074
 
5.7%
P 61992
 
5.7%
N 58841
 
5.4%
L 58075
 
5.4%
B 57938
 
5.3%
Other values (16) 270096
24.9%
Other Punctuation
ValueCountFrequency (%)
# 58707
83.0%
: 8052
 
11.4%
. 1285
 
1.8%
, 1132
 
1.6%
/ 829
 
1.2%
& 549
 
0.8%
' 164
 
0.2%
@ 17
 
< 0.1%
" 12
 
< 0.1%
* 8
 
< 0.1%
Other values (3) 8
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 338548
19.6%
0 232055
13.4%
2 222173
12.9%
3 178043
10.3%
5 155114
9.0%
4 154559
9.0%
6 121592
 
7.0%
7 114561
 
6.6%
8 109059
 
6.3%
9 99638
 
5.8%
Math Symbol
ValueCountFrequency (%)
| 33
45.2%
+ 33
45.2%
~ 4
 
5.5%
> 2
 
2.7%
< 1
 
1.4%
Space Separator
ValueCountFrequency (%)
1155250
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5110
100.0%
Close Punctuation
ValueCountFrequency (%)
) 669
100.0%
Open Punctuation
ValueCountFrequency (%)
( 666
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 7
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4050838
57.8%
Common 2957884
42.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 358127
 
8.8%
r 287515
 
7.1%
t 285628
 
7.1%
a 269227
 
6.6%
n 248642
 
6.1%
l 209098
 
5.2%
o 197816
 
4.9%
i 188590
 
4.7%
S 168200
 
4.2%
d 153033
 
3.8%
Other values (42) 1684962
41.6%
Common
ValueCountFrequency (%)
1155250
39.1%
1 338548
 
11.4%
0 232055
 
7.8%
2 222173
 
7.5%
3 178043
 
6.0%
5 155114
 
5.2%
4 154559
 
5.2%
6 121592
 
4.1%
7 114561
 
3.9%
8 109059
 
3.7%
Other values (25) 176930
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7008722
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1155250
 
16.5%
e 358127
 
5.1%
1 338548
 
4.8%
r 287515
 
4.1%
t 285628
 
4.1%
a 269227
 
3.8%
n 248642
 
3.5%
0 232055
 
3.3%
2 222173
 
3.2%
l 209098
 
3.0%
Other values (77) 3402459
48.5%

baths
Text

Missing 

Distinct229
Distinct (%)0.1%
Missing106338
Missing (%)28.2%
Memory size2.9 MiB
2024-11-26T12:57:47.301070image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length5.4117048
Min length1

Characters and Unicode

Total characters1465744
Distinct characters35
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)< 0.1%

Sample

1st row3.5
2nd row3 Baths
3rd row2 Baths
4th row8 Baths
5th row2
ValueCountFrequency (%)
baths 121289
28.7%
2 85147
20.2%
3 54127
12.8%
bathrooms 23281
 
5.5%
4 21450
 
5.1%
2.0 16576
 
3.9%
2.5 12892
 
3.1%
3.0 10869
 
2.6%
1 10579
 
2.5%
5 7666
 
1.8%
Other values (128) 58522
13.9%
2024-11-26T12:57:47.649614image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
151753
10.4%
a 151195
10.3%
t 144772
9.9%
s 144570
9.9%
h 144570
9.9%
B 144054
9.8%
2 122744
8.4%
0 74485
 
5.1%
3 72973
 
5.0%
. 64765
 
4.4%
Other values (25) 249863
17.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 685579
46.8%
Decimal Number 378683
25.8%
Space Separator 151753
 
10.4%
Uppercase Letter 144460
 
9.9%
Other Punctuation 102517
 
7.0%
Math Symbol 1716
 
0.1%
Dash Punctuation 1036
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 151195
22.1%
t 144772
21.1%
s 144570
21.1%
h 144570
21.1%
o 46563
 
6.8%
m 23282
 
3.4%
r 23281
 
3.4%
b 7141
 
1.0%
q 202
 
< 0.1%
e 1
 
< 0.1%
Other values (2) 2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 122744
32.4%
0 74485
19.7%
3 72973
19.3%
5 42488
 
11.2%
4 27979
 
7.4%
1 26418
 
7.0%
7 5244
 
1.4%
6 4567
 
1.2%
8 1227
 
0.3%
9 558
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
B 144054
99.7%
S 203
 
0.1%
F 202
 
0.1%
M 1
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
. 64765
63.2%
: 23281
 
22.7%
, 14394
 
14.0%
/ 77
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 1030
99.4%
6
 
0.6%
Math Symbol
ValueCountFrequency (%)
+ 934
54.4%
~ 782
45.6%
Space Separator
ValueCountFrequency (%)
151753
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 830039
56.6%
Common 635705
43.4%

Most frequent character per script

Common
ValueCountFrequency (%)
151753
23.9%
2 122744
19.3%
0 74485
11.7%
3 72973
11.5%
. 64765
10.2%
5 42488
 
6.7%
4 27979
 
4.4%
1 26418
 
4.2%
: 23281
 
3.7%
, 14394
 
2.3%
Other values (9) 14425
 
2.3%
Latin
ValueCountFrequency (%)
a 151195
18.2%
t 144772
17.4%
s 144570
17.4%
h 144570
17.4%
B 144054
17.4%
o 46563
 
5.6%
m 23282
 
2.8%
r 23281
 
2.8%
b 7141
 
0.9%
S 203
 
< 0.1%
Other values (6) 408
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1465738
> 99.9%
Punctuation 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
151753
10.4%
a 151195
10.3%
t 144772
9.9%
s 144570
9.9%
h 144570
9.9%
B 144054
9.8%
2 122744
8.4%
0 74485
 
5.1%
3 72973
 
5.0%
. 64765
 
4.4%
Other values (24) 249857
17.0%
Punctuation
ValueCountFrequency (%)
6
100.0%
Distinct321009
Distinct (%)85.1%
Missing0
Missing (%)0.0%
Memory size2.9 MiB
2024-11-26T12:57:48.083645image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length840
Median length605
Mean length374.50896
Min length334

Characters and Unicode

Total characters141259163
Distinct characters86
Distinct categories11 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique311038 ?
Unique (%)82.5%

Sample

1st row{'atAGlanceFacts': [{'factValue': '2019', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': 'Central A/C, Heat Pump', 'factLabel': 'Heating'}, {'factValue': '', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': None, 'factLabel': 'lotsize'}, {'factValue': '$144', 'factLabel': 'Price/sqft'}]}
2nd row{'atAGlanceFacts': [{'factValue': '2019', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': '', 'factLabel': 'Heating'}, {'factValue': '', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '5828 sqft', 'factLabel': 'lotsize'}, {'factValue': '$159/sqft', 'factLabel': 'Price/sqft'}]}
3rd row{'atAGlanceFacts': [{'factValue': '1961', 'factLabel': 'Year built'}, {'factValue': '1967', 'factLabel': 'Remodeled year'}, {'factValue': 'Forced Air', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': 'Attached Garage', 'factLabel': 'Parking'}, {'factValue': '8,626 sqft', 'factLabel': 'lotsize'}, {'factValue': '$965/sqft', 'factLabel': 'Price/sqft'}]}
4th row{'atAGlanceFacts': [{'factValue': '2006', 'factLabel': 'Year built'}, {'factValue': '2006', 'factLabel': 'Remodeled year'}, {'factValue': 'Forced Air', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': 'Detached Garage', 'factLabel': 'Parking'}, {'factValue': '8,220 sqft', 'factLabel': 'lotsize'}, {'factValue': '$371/sqft', 'factLabel': 'Price/sqft'}]}
5th row{'atAGlanceFacts': [{'factValue': '', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': '', 'factLabel': 'Heating'}, {'factValue': '', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '10,019 sqft', 'factLabel': 'lotsize'}, {'factValue': None, 'factLabel': 'Price/sqft'}]}
ValueCountFrequency (%)
factlabel 2640295
20.7%
factvalue 2640295
20.7%
757424
 
5.9%
year 754370
 
5.9%
cooling 390962
 
3.1%
heating 389267
 
3.1%
parking 383675
 
3.0%
price/sqft 377185
 
3.0%
lotsize 377185
 
3.0%
ataglancefacts 377185
 
3.0%
Other values (29472) 3668656
28.8%
2024-11-26T12:57:48.684677image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
' 21620955
15.3%
a 14183171
 
10.0%
12379318
 
8.8%
e 9771911
 
6.9%
t 8594876
 
6.1%
l 7533640
 
5.3%
c 6890716
 
4.9%
f 5960366
 
4.2%
: 5657791
 
4.0%
, 5126864
 
3.6%
Other values (76) 43539555
30.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 74422172
52.7%
Other Punctuation 33220252
23.5%
Space Separator 12379318
 
8.8%
Uppercase Letter 10165905
 
7.2%
Decimal Number 3896951
 
2.8%
Close Punctuation 3409932
 
2.4%
Open Punctuation 3409932
 
2.4%
Currency Symbol 311248
 
0.2%
Dash Punctuation 42947
 
< 0.1%
Math Symbol 505
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 14183171
19.1%
e 9771911
13.1%
t 8594876
11.5%
l 7533640
10.1%
c 6890716
9.3%
f 5960366
8.0%
u 3047834
 
4.1%
b 3024601
 
4.1%
i 2569220
 
3.5%
r 2434890
 
3.3%
Other values (16) 10410947
14.0%
Uppercase Letter
ValueCountFrequency (%)
V 2642336
26.0%
L 2642024
26.0%
P 774976
 
7.6%
C 671964
 
6.6%
A 608724
 
6.0%
F 608256
 
6.0%
G 515037
 
5.1%
H 436687
 
4.3%
R 388024
 
3.8%
Y 378210
 
3.7%
Other values (16) 499667
 
4.9%
Decimal Number
ValueCountFrequency (%)
1 758148
19.5%
0 606701
15.6%
9 559266
14.4%
2 486731
12.5%
5 270373
 
6.9%
8 253021
 
6.5%
7 247241
 
6.3%
6 245449
 
6.3%
4 235728
 
6.0%
3 234293
 
6.0%
Other Punctuation
ValueCountFrequency (%)
' 21620955
65.1%
: 5657791
 
17.0%
, 5126864
 
15.4%
/ 582199
 
1.8%
. 231556
 
0.7%
" 668
 
< 0.1%
% 176
 
< 0.1%
& 42
 
< 0.1%
; 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
+ 494
97.8%
> 5
 
1.0%
~ 4
 
0.8%
< 2
 
0.4%
Close Punctuation
ValueCountFrequency (%)
} 3017480
88.5%
] 377185
 
11.1%
) 15267
 
0.4%
Open Punctuation
ValueCountFrequency (%)
{ 3017480
88.5%
[ 377185
 
11.1%
( 15267
 
0.4%
Dash Punctuation
ValueCountFrequency (%)
25251
58.8%
- 17696
41.2%
Space Separator
ValueCountFrequency (%)
12379318
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 311248
100.0%
Other Symbol
ValueCountFrequency (%)
® 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 84588077
59.9%
Common 56671086
40.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 14183171
16.8%
e 9771911
11.6%
t 8594876
10.2%
l 7533640
 
8.9%
c 6890716
 
8.1%
f 5960366
 
7.0%
u 3047834
 
3.6%
b 3024601
 
3.6%
V 2642336
 
3.1%
L 2642024
 
3.1%
Other values (42) 20296602
24.0%
Common
ValueCountFrequency (%)
' 21620955
38.2%
12379318
21.8%
: 5657791
 
10.0%
, 5126864
 
9.0%
} 3017480
 
5.3%
{ 3017480
 
5.3%
1 758148
 
1.3%
0 606701
 
1.1%
/ 582199
 
1.0%
9 559266
 
1.0%
Other values (24) 3344884
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 141233910
> 99.9%
Punctuation 25251
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
' 21620955
15.3%
a 14183171
 
10.0%
12379318
 
8.8%
e 9771911
 
6.9%
t 8594876
 
6.1%
l 7533640
 
5.3%
c 6890716
 
4.9%
f 5960366
 
4.2%
: 5657791
 
4.0%
, 5126864
 
3.6%
Other values (73) 43514302
30.8%
Punctuation
ValueCountFrequency (%)
25251
100.0%
None
ValueCountFrequency (%)
 1
50.0%
® 1
50.0%

fireplace
Text

Missing 

Distinct1652
Distinct (%)1.6%
Missing274071
Missing (%)72.7%
Memory size2.9 MiB
2024-11-26T12:57:49.016451image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length231
Median length3
Mean length5.0304614
Min length1

Characters and Unicode

Total characters518711
Distinct characters69
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1038 ?
Unique (%)1.0%

Sample

1st rowGas Logs
2nd rowyes
3rd rowyes
4th rowyes
5th rowYes
ValueCountFrequency (%)
yes 71212
53.6%
1 15304
 
11.5%
room 3325
 
2.5%
fireplace 3304
 
2.5%
gas 3127
 
2.4%
2 2535
 
1.9%
not 1993
 
1.5%
applicable 1993
 
1.5%
closets 1725
 
1.3%
wood 1710
 
1.3%
Other values (350) 26724
 
20.1%
2024-11-26T12:57:49.502245image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 93119
18.0%
s 81955
15.8%
y 52257
 
10.1%
29838
 
5.8%
Y 21802
 
4.2%
o 21186
 
4.1%
i 18735
 
3.6%
a 18442
 
3.6%
1 15315
 
3.0%
l 15193
 
2.9%
Other values (59) 150869
29.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 382475
73.7%
Uppercase Letter 74561
 
14.4%
Space Separator 29838
 
5.8%
Decimal Number 19072
 
3.7%
Other Punctuation 10723
 
2.1%
Dash Punctuation 1841
 
0.4%
Math Symbol 71
 
< 0.1%
Open Punctuation 65
 
< 0.1%
Close Punctuation 65
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 93119
24.3%
s 81955
21.4%
y 52257
13.7%
o 21186
 
5.5%
i 18735
 
4.9%
a 18442
 
4.8%
l 15193
 
4.0%
n 14008
 
3.7%
r 12572
 
3.3%
t 12141
 
3.2%
Other values (15) 42867
11.2%
Uppercase Letter
ValueCountFrequency (%)
Y 21802
29.2%
F 8260
 
11.1%
L 5104
 
6.8%
G 4755
 
6.4%
R 4675
 
6.3%
C 4488
 
6.0%
A 3580
 
4.8%
W 3439
 
4.6%
N 3362
 
4.5%
I 3250
 
4.4%
Other values (14) 11846
15.9%
Decimal Number
ValueCountFrequency (%)
1 15315
80.3%
2 2537
 
13.3%
3 619
 
3.2%
0 277
 
1.5%
4 198
 
1.0%
5 65
 
0.3%
6 35
 
0.2%
7 18
 
0.1%
8 5
 
< 0.1%
9 3
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
, 9818
91.6%
/ 734
 
6.8%
# 167
 
1.6%
. 3
 
< 0.1%
: 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
29838
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1841
100.0%
Math Symbol
ValueCountFrequency (%)
+ 71
100.0%
Open Punctuation
ValueCountFrequency (%)
( 65
100.0%
Close Punctuation
ValueCountFrequency (%)
) 65
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 457036
88.1%
Common 61675
 
11.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 93119
20.4%
s 81955
17.9%
y 52257
11.4%
Y 21802
 
4.8%
o 21186
 
4.6%
i 18735
 
4.1%
a 18442
 
4.0%
l 15193
 
3.3%
n 14008
 
3.1%
r 12572
 
2.8%
Other values (39) 107767
23.6%
Common
ValueCountFrequency (%)
29838
48.4%
1 15315
24.8%
, 9818
 
15.9%
2 2537
 
4.1%
- 1841
 
3.0%
/ 734
 
1.2%
3 619
 
1.0%
0 277
 
0.4%
4 198
 
0.3%
# 167
 
0.3%
Other values (10) 331
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 518711
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 93119
18.0%
s 81955
15.8%
y 52257
 
10.1%
29838
 
5.8%
Y 21802
 
4.2%
o 21186
 
4.1%
i 18735
 
3.6%
a 18442
 
3.6%
1 15315
 
3.0%
l 15193
 
2.9%
Other values (59) 150869
29.1%

city
Text

Distinct2026
Distinct (%)0.5%
Missing34
Missing (%)< 0.1%
Memory size2.9 MiB
2024-11-26T12:57:49.813767image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length38
Median length29
Mean length8.9985735
Min length1

Characters and Unicode

Total characters3393821
Distinct characters60
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique435 ?
Unique (%)0.1%

Sample

1st rowSouthern Pines
2nd rowSpokane Valley
3rd rowLos Angeles
4th rowDallas
5th rowPalm Bay
ValueCountFrequency (%)
houston 24460
 
4.8%
miami 20776
 
4.1%
san 19402
 
3.8%
antonio 15592
 
3.1%
fort 11470
 
2.3%
jacksonville 10375
 
2.1%
charlotte 9694
 
1.9%
dallas 8858
 
1.8%
beach 8785
 
1.7%
brooklyn 7298
 
1.4%
Other values (1701) 367717
72.9%
2024-11-26T12:57:50.297680image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 343158
 
10.1%
o 283839
 
8.4%
n 257160
 
7.6%
e 248081
 
7.3%
l 224356
 
6.6%
i 219089
 
6.5%
t 194518
 
5.7%
s 159791
 
4.7%
r 151339
 
4.5%
127326
 
3.8%
Other values (50) 1185164
34.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2677503
78.9%
Uppercase Letter 588826
 
17.3%
Space Separator 127326
 
3.8%
Other Punctuation 94
 
< 0.1%
Dash Punctuation 64
 
< 0.1%
Decimal Number 4
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 343158
12.8%
o 283839
10.6%
n 257160
9.6%
e 248081
9.3%
l 224356
8.4%
i 219089
8.2%
t 194518
 
7.3%
s 159791
 
6.0%
r 151339
 
5.7%
h 89796
 
3.4%
Other values (16) 506376
18.9%
Uppercase Letter
ValueCountFrequency (%)
S 55874
 
9.5%
C 50725
 
8.6%
A 50275
 
8.5%
P 43885
 
7.5%
H 41896
 
7.1%
M 37157
 
6.3%
L 36547
 
6.2%
B 32977
 
5.6%
D 27228
 
4.6%
O 22869
 
3.9%
Other values (15) 189393
32.2%
Other Punctuation
ValueCountFrequency (%)
. 91
96.8%
' 2
 
2.1%
/ 1
 
1.1%
Decimal Number
ValueCountFrequency (%)
3 2
50.0%
0 2
50.0%
Space Separator
ValueCountFrequency (%)
127326
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 64
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3266329
96.2%
Common 127492
 
3.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 343158
 
10.5%
o 283839
 
8.7%
n 257160
 
7.9%
e 248081
 
7.6%
l 224356
 
6.9%
i 219089
 
6.7%
t 194518
 
6.0%
s 159791
 
4.9%
r 151339
 
4.6%
h 89796
 
2.7%
Other values (41) 1095202
33.5%
Common
ValueCountFrequency (%)
127326
99.9%
. 91
 
0.1%
- 64
 
0.1%
3 2
 
< 0.1%
0 2
 
< 0.1%
( 2
 
< 0.1%
) 2
 
< 0.1%
' 2
 
< 0.1%
/ 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3393821
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 343158
 
10.1%
o 283839
 
8.4%
n 257160
 
7.6%
e 248081
 
7.3%
l 224356
 
6.6%
i 219089
 
6.5%
t 194518
 
5.7%
s 159791
 
4.7%
r 151339
 
4.5%
127326
 
3.8%
Other values (50) 1185164
34.9%
Distinct297365
Distinct (%)78.8%
Missing0
Missing (%)0.0%
Memory size2.9 MiB
2024-11-26T12:57:50.700723image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length3688
Median length3289
Mean length301.95763
Min length68

Characters and Unicode

Total characters113893888
Distinct characters84
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique265895 ?
Unique (%)70.5%

Sample

1st row[{'rating': ['4', '4', '7', 'NR', '4', '7', 'NR', 'NR'], 'data': {'Distance': ['2.7 mi', '3.6 mi', '5.1 mi', '4.0 mi', '10.5 mi', '12.6 mi', '2.7 mi', '3.1 mi'], 'Grades': ['3–5', '6–8', '9–12', 'PK–2', '6–8', '9–12', 'PK–5', 'K–12']}, 'name': ['Southern Pines Elementary School', 'Southern Middle School', 'Pinecrest High School', 'Southern Pines Primary School', "Crain's Creek Middle School", 'Union Pines High School', 'Episcopal Day Private School', 'Calvary Christian Private School']}]
2nd row[{'rating': ['4/10', 'None/10', '4/10'], 'data': {'Distance': ['1.65mi', '1.32mi', '1.01mi'], 'Grades': ['9-12', '3-8', 'PK-8']}, 'name': ['East Valley High School&Extension', 'Eastvalley Middle School', 'Trentwood Elementary School']}]
3rd row[{'rating': ['8/10', '4/10', '8/10'], 'data': {'Distance': ['1.19mi', '2.06mi', '2.63mi'], 'Grades': ['6-8', 'K-5', '9-12']}, 'name': ['Paul Revere Middle School', 'Brentwood Science School', 'Palisades Charter High School']}]
4th row[{'rating': ['9/10', '9/10', '10/10', '9/10'], 'data': {'Distance': ['1.05mi', '0.1mi', '1.05mi', '0.81mi'], 'Grades': ['5-6', 'PK-4', '7-8', '9-12']}, 'name': ['Mcculloch Intermediate School', 'Bradfield Elementary School', 'Highland Park Middle School', 'Highland Park High School']}]
5th row[{'rating': ['4/10', '5/10', '5/10'], 'data': {'Distance': ['5.96mi', '3.25mi', '3.03mi'], 'Grades': ['7-8', '9-12', 'PK-6']}, 'name': ['Southwest Middle School', 'Bayside High School', 'Westside Elementary School']}]
ValueCountFrequency (%)
school 1414258
 
9.8%
mi 906984
 
6.3%
elementary 446657
 
3.1%
high 438612
 
3.0%
name 377355
 
2.6%
grades 377288
 
2.6%
rating 377185
 
2.6%
data 377185
 
2.6%
distance 377185
 
2.6%
middle 322399
 
2.2%
Other values (16026) 9085018
62.7%
2024-11-26T12:57:51.520546image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
' 16750056
 
14.7%
14127354
 
12.4%
, 6138741
 
5.4%
e 4777468
 
4.2%
o 4683179
 
4.1%
a 4603527
 
4.0%
i 4521043
 
4.0%
l 3313909
 
2.9%
t 3119253
 
2.7%
n 3061675
 
2.7%
Other values (74) 48797683
42.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 46326910
40.7%
Other Punctuation 27492645
24.1%
Space Separator 14127354
 
12.4%
Decimal Number 10672263
 
9.4%
Uppercase Letter 8555800
 
7.5%
Open Punctuation 2646690
 
2.3%
Close Punctuation 2646689
 
2.3%
Dash Punctuation 1425441
 
1.3%
Math Symbol 96
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 4777468
10.3%
o 4683179
10.1%
a 4603527
9.9%
i 4521043
9.8%
l 3313909
 
7.2%
t 3119253
 
6.7%
n 3061675
 
6.6%
r 2990290
 
6.5%
m 2887012
 
6.2%
h 2482228
 
5.4%
Other values (17) 9887326
21.3%
Uppercase Letter
ValueCountFrequency (%)
S 1736884
20.3%
P 823370
9.6%
K 700928
 
8.2%
H 627414
 
7.3%
M 588358
 
6.9%
E 573335
 
6.7%
D 489345
 
5.7%
G 481790
 
5.6%
C 418547
 
4.9%
A 318764
 
3.7%
Other values (16) 1797065
21.0%
Other Punctuation
ValueCountFrequency (%)
' 16750056
60.9%
, 6138741
 
22.3%
: 1887363
 
6.9%
. 1691659
 
6.2%
/ 986866
 
3.6%
" 22426
 
0.1%
& 9112
 
< 0.1%
@ 5683
 
< 0.1%
# 532
 
< 0.1%
\ 198
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 2443792
22.9%
0 1678654
15.7%
2 1271285
11.9%
8 942715
 
8.8%
5 926544
 
8.7%
6 872241
 
8.2%
9 788850
 
7.4%
3 650120
 
6.1%
4 598916
 
5.6%
7 499146
 
4.7%
Close Punctuation
ValueCountFrequency (%)
] 1885925
71.3%
} 754370
 
28.5%
) 6394
 
0.2%
Open Punctuation
ValueCountFrequency (%)
[ 1885925
71.3%
{ 754370
 
28.5%
( 6395
 
0.2%
Dash Punctuation
ValueCountFrequency (%)
- 1047339
73.5%
378102
 
26.5%
Space Separator
ValueCountFrequency (%)
14127354
100.0%
Math Symbol
ValueCountFrequency (%)
+ 96
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 59011178
51.8%
Latin 54882710
48.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 4777468
 
8.7%
o 4683179
 
8.5%
a 4603527
 
8.4%
i 4521043
 
8.2%
l 3313909
 
6.0%
t 3119253
 
5.7%
n 3061675
 
5.6%
r 2990290
 
5.4%
m 2887012
 
5.3%
h 2482228
 
4.5%
Other values (43) 18443126
33.6%
Common
ValueCountFrequency (%)
' 16750056
28.4%
14127354
23.9%
, 6138741
 
10.4%
1 2443792
 
4.1%
: 1887363
 
3.2%
] 1885925
 
3.2%
[ 1885925
 
3.2%
. 1691659
 
2.9%
0 1678654
 
2.8%
2 1271285
 
2.2%
Other values (21) 9250424
15.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 113515768
99.7%
Punctuation 378102
 
0.3%
None 18
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
' 16750056
 
14.8%
14127354
 
12.4%
, 6138741
 
5.4%
e 4777468
 
4.2%
o 4683179
 
4.1%
a 4603527
 
4.1%
i 4521043
 
4.0%
l 3313909
 
2.9%
t 3119253
 
2.7%
n 3061675
 
2.7%
Other values (72) 48419563
42.7%
Punctuation
ValueCountFrequency (%)
378102
100.0%
None
ValueCountFrequency (%)
é 18
100.0%

sqft
Text

Missing 

Distinct25405
Distinct (%)7.5%
Missing40577
Missing (%)10.8%
Memory size2.9 MiB
2024-11-26T12:57:51.901360image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length41
Median length40
Mean length9.3000196
Min length1

Characters and Unicode

Total characters3130461
Distinct characters28
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7325 ?
Unique (%)2.2%

Sample

1st row2900
2nd row1,947 sqft
3rd row3,000 sqft
4th row6,457 sqft
5th row897 sqft
ValueCountFrequency (%)
sqft 182737
29.8%
area 23678
 
3.9%
livable 23678
 
3.9%
interior 23678
 
3.9%
total 23678
 
3.9%
0 11854
 
1.9%
1,200 1298
 
0.2%
1,000 955
 
0.2%
1,500 911
 
0.1%
1,100 879
 
0.1%
Other values (14448) 320711
52.2%
2024-11-26T12:57:52.448729image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
277449
 
8.9%
, 253135
 
8.1%
1 232504
 
7.4%
t 230093
 
7.4%
2 184940
 
5.9%
s 182737
 
5.8%
q 182737
 
5.8%
f 182737
 
5.8%
0 170258
 
5.4%
3 115328
 
3.7%
Other values (18) 1118543
35.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1275542
40.7%
Decimal Number 1275338
40.7%
Space Separator 277449
 
8.9%
Other Punctuation 276813
 
8.8%
Uppercase Letter 23678
 
0.8%
Dash Punctuation 1641
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 230093
18.0%
s 182737
14.3%
q 182737
14.3%
f 182737
14.3%
a 94712
7.4%
l 71034
 
5.6%
i 71034
 
5.6%
e 71034
 
5.6%
r 71034
 
5.6%
o 47356
 
3.7%
Other values (3) 71034
 
5.6%
Decimal Number
ValueCountFrequency (%)
1 232504
18.2%
2 184940
14.5%
0 170258
13.4%
3 115328
9.0%
4 108611
8.5%
5 99097
7.8%
6 98967
7.8%
8 98184
7.7%
7 84992
 
6.7%
9 82457
 
6.5%
Other Punctuation
ValueCountFrequency (%)
, 253135
91.4%
: 23678
 
8.6%
Space Separator
ValueCountFrequency (%)
277449
100.0%
Uppercase Letter
ValueCountFrequency (%)
T 23678
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1641
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1831241
58.5%
Latin 1299220
41.5%

Most frequent character per script

Common
ValueCountFrequency (%)
277449
15.2%
, 253135
13.8%
1 232504
12.7%
2 184940
10.1%
0 170258
9.3%
3 115328
6.3%
4 108611
 
5.9%
5 99097
 
5.4%
6 98967
 
5.4%
8 98184
 
5.4%
Other values (4) 192768
10.5%
Latin
ValueCountFrequency (%)
t 230093
17.7%
s 182737
14.1%
q 182737
14.1%
f 182737
14.1%
a 94712
7.3%
l 71034
 
5.5%
i 71034
 
5.5%
e 71034
 
5.5%
r 71034
 
5.5%
o 47356
 
3.6%
Other values (4) 94712
7.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3130461
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
277449
 
8.9%
, 253135
 
8.1%
1 232504
 
7.4%
t 230093
 
7.4%
2 184940
 
5.9%
s 182737
 
5.8%
q 182737
 
5.8%
f 182737
 
5.8%
0 170258
 
5.4%
3 115328
 
3.7%
Other values (18) 1118543
35.7%
Distinct4549
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.9 MiB
2024-11-26T12:57:52.820618image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length10
Median length5
Mean length4.9980593
Min length1

Characters and Unicode

Total characters1885193
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique598 ?
Unique (%)0.2%

Sample

1st row28387
2nd row99216
3rd row90049
4th row75205
5th row32908
ValueCountFrequency (%)
32137 2141
 
0.6%
33131 1563
 
0.4%
34747 1488
 
0.4%
78245 1390
 
0.4%
34759 1333
 
0.4%
33132 1328
 
0.4%
33137 1308
 
0.3%
78253 1282
 
0.3%
78254 1238
 
0.3%
33130 1170
 
0.3%
Other values (4539) 362944
96.2%
2024-11-26T12:57:53.353750image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 336087
17.8%
1 233614
12.4%
2 233487
12.4%
7 230508
12.2%
0 221306
11.7%
4 151634
8.0%
8 148263
7.9%
9 120899
 
6.4%
6 105045
 
5.6%
5 104106
 
5.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1884949
> 99.9%
Dash Punctuation 244
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 336087
17.8%
1 233614
12.4%
2 233487
12.4%
7 230508
12.2%
0 221306
11.7%
4 151634
8.0%
8 148263
7.9%
9 120899
 
6.4%
6 105045
 
5.6%
5 104106
 
5.5%
Dash Punctuation
ValueCountFrequency (%)
- 244
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1885193
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 336087
17.8%
1 233614
12.4%
2 233487
12.4%
7 230508
12.2%
0 221306
11.7%
4 151634
8.0%
8 148263
7.9%
9 120899
 
6.4%
6 105045
 
5.6%
5 104106
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1885193
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 336087
17.8%
1 233614
12.4%
2 233487
12.4%
7 230508
12.2%
0 221306
11.7%
4 151634
8.0%
8 148263
7.9%
9 120899
 
6.4%
6 105045
 
5.6%
5 104106
 
5.5%

beds
Text

Missing 

Distinct1184
Distinct (%)0.4%
Missing91282
Missing (%)24.2%
Memory size2.9 MiB
2024-11-26T12:57:53.531563image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length122
Median length121
Mean length4.1196
Min length1

Characters and Unicode

Total characters1177806
Distinct characters56
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique715 ?
Unique (%)0.3%

Sample

1st row4
2nd row3 Beds
3rd row3 Beds
4th row5 Beds
5th row2 Beds
ValueCountFrequency (%)
beds 133199
29.3%
3 97751
21.5%
4 63717
14.0%
2 47731
 
10.5%
bd 32127
 
7.1%
5 20337
 
4.5%
baths 15283
 
3.4%
3.0 8088
 
1.8%
6 6276
 
1.4%
1 5744
 
1.3%
Other values (1126) 23708
 
5.2%
2024-11-26T12:57:53.964181image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
168960
14.3%
d 165341
14.0%
s 151538
12.9%
B 149236
12.7%
e 134898
11.5%
3 106736
9.1%
4 69967
5.9%
2 51383
 
4.4%
b 32131
 
2.7%
5 22785
 
1.9%
Other values (46) 124831
10.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 541541
46.0%
Decimal Number 294668
25.0%
Space Separator 168960
 
14.3%
Uppercase Letter 149291
 
12.7%
Other Punctuation 21120
 
1.8%
Dash Punctuation 2224
 
0.2%
Currency Symbol 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
d 165341
30.5%
s 151538
28.0%
e 134898
24.9%
b 32131
 
5.9%
a 17696
 
3.3%
t 17489
 
3.2%
h 16041
 
3.0%
r 1677
 
0.3%
c 1649
 
0.3%
f 1441
 
0.3%
Other values (11) 1640
 
0.3%
Uppercase Letter
ValueCountFrequency (%)
B 149236
> 99.9%
R 19
 
< 0.1%
L 5
 
< 0.1%
O 4
 
< 0.1%
M 4
 
< 0.1%
E 3
 
< 0.1%
I 3
 
< 0.1%
K 3
 
< 0.1%
D 3
 
< 0.1%
C 2
 
< 0.1%
Other values (6) 9
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
3 106736
36.2%
4 69967
23.7%
2 51383
17.4%
5 22785
 
7.7%
0 22008
 
7.5%
1 8319
 
2.8%
6 7475
 
2.5%
7 2707
 
0.9%
8 2019
 
0.7%
9 1269
 
0.4%
Other Punctuation
ValueCountFrequency (%)
. 19750
93.5%
, 1359
 
6.4%
' 4
 
< 0.1%
/ 4
 
< 0.1%
% 2
 
< 0.1%
# 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
168960
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2224
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 690832
58.7%
Common 486974
41.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
d 165341
23.9%
s 151538
21.9%
B 149236
21.6%
e 134898
19.5%
b 32131
 
4.7%
a 17696
 
2.6%
t 17489
 
2.5%
h 16041
 
2.3%
r 1677
 
0.2%
c 1649
 
0.2%
Other values (27) 3136
 
0.5%
Common
ValueCountFrequency (%)
168960
34.7%
3 106736
21.9%
4 69967
14.4%
2 51383
 
10.6%
5 22785
 
4.7%
0 22008
 
4.5%
. 19750
 
4.1%
1 8319
 
1.7%
6 7475
 
1.5%
7 2707
 
0.6%
Other values (9) 6884
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1177806
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
168960
14.3%
d 165341
14.0%
s 151538
12.9%
B 149236
12.7%
e 134898
11.5%
3 106736
9.1%
4 69967
5.9%
2 51383
 
4.4%
b 32131
 
2.7%
5 22785
 
1.9%
Other values (46) 124831
10.6%

state
Categorical

Distinct39
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.9 MiB
FL
115449 
TX
83786 
NY
24479 
CA
23386 
NC
21862 
Other values (34)
108223 

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters754370
Distinct characters26
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st rowNC
2nd rowWA
3rd rowCA
4th rowTX
5th rowFL

Common Values

ValueCountFrequency (%)
FL 115449
30.6%
TX 83786
22.2%
NY 24479
 
6.5%
CA 23386
 
6.2%
NC 21862
 
5.8%
TN 18340
 
4.9%
WA 13826
 
3.7%
OH 12588
 
3.3%
IL 8939
 
2.4%
NV 8482
 
2.2%
Other values (29) 46048
 
12.2%

Length

2024-11-26T12:57:54.102647image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
fl 115450
30.6%
tx 83786
22.2%
ny 24479
 
6.5%
ca 23386
 
6.2%
nc 21862
 
5.8%
tn 18340
 
4.9%
wa 13826
 
3.7%
oh 12588
 
3.3%
il 8939
 
2.4%
nv 8482
 
2.2%
Other values (28) 46047
 
12.2%

Most occurring characters

ValueCountFrequency (%)
L 124389
16.5%
F 115450
15.3%
T 104327
13.8%
X 83786
11.1%
N 76927
10.2%
C 56354
7.5%
A 55386
7.3%
Y 24569
 
3.3%
O 22698
 
3.0%
I 18122
 
2.4%
Other values (16) 72362
9.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 754369
> 99.9%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
L 124389
16.5%
F 115450
15.3%
T 104327
13.8%
X 83786
11.1%
N 76927
10.2%
C 56354
7.5%
A 55386
7.3%
Y 24569
 
3.3%
O 22698
 
3.0%
I 18122
 
2.4%
Other values (15) 72361
9.6%
Lowercase Letter
ValueCountFrequency (%)
l 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 754370
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
L 124389
16.5%
F 115450
15.3%
T 104327
13.8%
X 83786
11.1%
N 76927
10.2%
C 56354
7.5%
A 55386
7.3%
Y 24569
 
3.3%
O 22698
 
3.0%
I 18122
 
2.4%
Other values (16) 72362
9.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 754370
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
L 124389
16.5%
F 115450
15.3%
T 104327
13.8%
X 83786
11.1%
N 76927
10.2%
C 56354
7.5%
A 55386
7.3%
Y 24569
 
3.3%
O 22698
 
3.0%
I 18122
 
2.4%
Other values (16) 72362
9.6%

stories
Text

Missing 

Distinct347
Distinct (%)0.2%
Missing150716
Missing (%)40.0%
Memory size2.9 MiB
2024-11-26T12:57:54.436060image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length42
Median length3
Mean length2.852439
Min length1

Characters and Unicode

Total characters645989
Distinct characters61
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)< 0.1%

Sample

1st row2.0
2nd row1.0
3rd row3.0
4th row2.0
5th rowOne
ValueCountFrequency (%)
1.0 67454
28.5%
2.0 55283
23.4%
1 24830
 
10.5%
2 20958
 
8.9%
3.0 11275
 
4.8%
0.0 7241
 
3.1%
one 6367
 
2.7%
3 5398
 
2.3%
story 4718
 
2.0%
0 4273
 
1.8%
Other values (266) 28640
12.1%
2024-11-26T12:57:54.954143image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 172813
26.8%
. 155919
24.1%
1 96437
14.9%
2 80167
12.4%
3 17634
 
2.7%
e 14174
 
2.2%
o 11316
 
1.8%
9968
 
1.5%
r 8570
 
1.3%
t 8164
 
1.3%
Other values (51) 70827
11.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 378924
58.7%
Other Punctuation 157572
24.4%
Lowercase Letter 75369
 
11.7%
Uppercase Letter 22948
 
3.6%
Space Separator 9968
 
1.5%
Math Symbol 886
 
0.1%
Dash Punctuation 320
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 14174
18.8%
o 11316
15.0%
r 8570
11.4%
t 8164
10.8%
n 8141
10.8%
y 5014
 
6.7%
i 3360
 
4.5%
w 3173
 
4.2%
l 3047
 
4.0%
u 1598
 
2.1%
Other values (13) 8812
11.7%
Uppercase Letter
ValueCountFrequency (%)
O 7070
30.8%
S 6408
27.9%
T 3933
17.1%
L 1815
 
7.9%
M 1412
 
6.2%
B 890
 
3.9%
R 614
 
2.7%
C 299
 
1.3%
A 217
 
0.9%
F 99
 
0.4%
Other values (10) 191
 
0.8%
Decimal Number
ValueCountFrequency (%)
0 172813
45.6%
1 96437
25.5%
2 80167
21.2%
3 17634
 
4.7%
9 3512
 
0.9%
4 3464
 
0.9%
5 2539
 
0.7%
6 1251
 
0.3%
7 639
 
0.2%
8 468
 
0.1%
Other Punctuation
ValueCountFrequency (%)
. 155919
99.0%
/ 980
 
0.6%
, 673
 
0.4%
Space Separator
ValueCountFrequency (%)
9968
100.0%
Math Symbol
ValueCountFrequency (%)
+ 886
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 320
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 547672
84.8%
Latin 98317
 
15.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 14174
14.4%
o 11316
11.5%
r 8570
 
8.7%
t 8164
 
8.3%
n 8141
 
8.3%
O 7070
 
7.2%
S 6408
 
6.5%
y 5014
 
5.1%
T 3933
 
4.0%
i 3360
 
3.4%
Other values (33) 22167
22.5%
Common
ValueCountFrequency (%)
0 172813
31.6%
. 155919
28.5%
1 96437
17.6%
2 80167
14.6%
3 17634
 
3.2%
9968
 
1.8%
9 3512
 
0.6%
4 3464
 
0.6%
5 2539
 
0.5%
6 1251
 
0.2%
Other values (8) 3968
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 645989
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 172813
26.8%
. 155919
24.1%
1 96437
14.9%
2 80167
12.4%
3 17634
 
2.7%
e 14174
 
2.2%
o 11316
 
1.8%
9968
 
1.5%
r 8570
 
1.3%
t 8164
 
1.3%
Other values (51) 70827
11.0%

mls-id
Text

Missing 

Distinct24907
Distinct (%)99.9%
Missing352243
Missing (%)93.4%
Memory size2.9 MiB
2024-11-26T12:57:55.249998image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length24
Median length12
Mean length7.8794002
Min length1

Characters and Unicode

Total characters196528
Distinct characters57
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24874 ?
Unique (%)99.7%

Sample

1st row19221142
2nd rowSR19195113
3rd row201909438
4th rowT3159863
5th rowT3204536
ValueCountFrequency (%)
no 8
 
< 0.1%
mls 5
 
< 0.1%
983469 2
 
< 0.1%
241766 2
 
< 0.1%
201906177 2
 
< 0.1%
74184012 2
 
< 0.1%
1020414 2
 
< 0.1%
19-5064 2
 
< 0.1%
201909981 2
 
< 0.1%
617190 2
 
< 0.1%
Other values (24897) 24922
99.9%
2024-11-26T12:57:55.682830image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 31390
16.0%
0 22606
11.5%
2 21470
10.9%
9 18569
9.4%
4 17233
8.8%
5 15664
8.0%
6 14750
7.5%
3 14566
7.4%
7 14516
7.4%
8 14386
7.3%
Other values (47) 11378
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 185150
94.2%
Uppercase Letter 10666
 
5.4%
Dash Punctuation 643
 
0.3%
Lowercase Letter 50
 
< 0.1%
Space Separator 12
 
< 0.1%
Other Punctuation 7
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 2251
21.1%
C 1139
10.7%
D 811
 
7.6%
F 668
 
6.3%
P 612
 
5.7%
R 600
 
5.6%
T 551
 
5.2%
S 539
 
5.1%
U 506
 
4.7%
O 413
 
3.9%
Other values (16) 2576
24.2%
Lowercase Letter
ValueCountFrequency (%)
o 8
16.0%
e 5
10.0%
d 5
10.0%
s 4
 
8.0%
f 4
 
8.0%
c 4
 
8.0%
w 3
 
6.0%
a 3
 
6.0%
t 2
 
4.0%
b 2
 
4.0%
Other values (7) 10
20.0%
Decimal Number
ValueCountFrequency (%)
1 31390
17.0%
0 22606
12.2%
2 21470
11.6%
9 18569
10.0%
4 17233
9.3%
5 15664
8.5%
6 14750
8.0%
3 14566
7.9%
7 14516
7.8%
8 14386
7.8%
Other Punctuation
ValueCountFrequency (%)
: 4
57.1%
# 3
42.9%
Dash Punctuation
ValueCountFrequency (%)
- 643
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 185812
94.5%
Latin 10716
 
5.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 2251
21.0%
C 1139
10.6%
D 811
 
7.6%
F 668
 
6.2%
P 612
 
5.7%
R 600
 
5.6%
T 551
 
5.1%
S 539
 
5.0%
U 506
 
4.7%
O 413
 
3.9%
Other values (33) 2626
24.5%
Common
ValueCountFrequency (%)
1 31390
16.9%
0 22606
12.2%
2 21470
11.6%
9 18569
10.0%
4 17233
9.3%
5 15664
8.4%
6 14750
7.9%
3 14566
7.8%
7 14516
7.8%
8 14386
7.7%
Other values (4) 662
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 196528
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 31390
16.0%
0 22606
11.5%
2 21470
10.9%
9 18569
9.4%
4 17233
8.8%
5 15664
8.0%
6 14750
7.5%
3 14566
7.4%
7 14516
7.4%
8 14386
7.3%
Other values (47) 11378
 
5.8%

PrivatePool
Boolean

Constant  Missing 

Distinct1
Distinct (%)< 0.1%
Missing336874
Missing (%)89.3%
Memory size736.8 KiB
True
40311 
(Missing)
336874 
ValueCountFrequency (%)
True 40311
 
10.7%
(Missing) 336874
89.3%
2024-11-26T12:57:55.820788image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

MlsId
Text

Missing 

Distinct232944
Distinct (%)75.1%
Missing66880
Missing (%)17.7%
Memory size2.9 MiB
2024-11-26T12:57:56.270442image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length60
Median length58
Mean length8.0889576
Min length1

Characters and Unicode

Total characters2510044
Distinct characters76
Distinct categories12 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique171673 ?
Unique (%)55.3%

Sample

1st row611019
2nd row201916904
3rd rowFR19221027
4th row14191809
5th row861745
ValueCountFrequency (%)
fl 2265
 
0.7%
miami 884
 
0.3%
tx 772
 
0.2%
beach 351
 
0.1%
houston 332
 
0.1%
orlando 302
 
0.1%
lauderdale 246
 
0.1%
fort 241
 
0.1%
ca 213
 
0.1%
austin 206
 
0.1%
Other values (232028) 319295
98.2%
2024-11-26T12:57:56.952008image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 359697
14.3%
0 275829
11.0%
2 243681
9.7%
4 222316
8.9%
5 215575
8.6%
3 197981
7.9%
9 194840
7.8%
7 194113
7.7%
6 193488
7.7%
8 187133
7.5%
Other values (66) 225391
9.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2284653
91.0%
Uppercase Letter 160344
 
6.4%
Lowercase Letter 32330
 
1.3%
Space Separator 18956
 
0.8%
Other Punctuation 8830
 
0.4%
Dash Punctuation 4918
 
0.2%
Currency Symbol 4
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Modifier Symbol 2
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 37284
23.3%
C 17144
10.7%
D 12958
 
8.1%
O 11635
 
7.3%
P 11002
 
6.9%
F 9517
 
5.9%
T 7678
 
4.8%
S 6952
 
4.3%
R 6579
 
4.1%
H 6387
 
4.0%
Other values (16) 33208
20.7%
Lowercase Letter
ValueCountFrequency (%)
a 4653
14.4%
i 3275
10.1%
o 3169
9.8%
n 2792
8.6%
e 2734
8.5%
t 2299
 
7.1%
r 2128
 
6.6%
l 2103
 
6.5%
s 1808
 
5.6%
d 1250
 
3.9%
Other values (13) 6119
18.9%
Decimal Number
ValueCountFrequency (%)
1 359697
15.7%
0 275829
12.1%
2 243681
10.7%
4 222316
9.7%
5 215575
9.4%
3 197981
8.7%
9 194840
8.5%
7 194113
8.5%
6 193488
8.5%
8 187133
8.2%
Other Punctuation
ValueCountFrequency (%)
, 8669
98.2%
/ 49
 
0.6%
: 47
 
0.5%
* 26
 
0.3%
# 18
 
0.2%
. 11
 
0.1%
& 7
 
0.1%
! 2
 
< 0.1%
; 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
18956
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4918
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2317370
92.3%
Latin 192674
 
7.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 37284
19.4%
C 17144
 
8.9%
D 12958
 
6.7%
O 11635
 
6.0%
P 11002
 
5.7%
F 9517
 
4.9%
T 7678
 
4.0%
S 6952
 
3.6%
R 6579
 
3.4%
H 6387
 
3.3%
Other values (39) 65538
34.0%
Common
ValueCountFrequency (%)
1 359697
15.5%
0 275829
11.9%
2 243681
10.5%
4 222316
9.6%
5 215575
9.3%
3 197981
8.5%
9 194840
8.4%
7 194113
8.4%
6 193488
8.3%
8 187133
8.1%
Other values (17) 32717
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2510044
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 359697
14.3%
0 275829
11.0%
2 243681
9.7%
4 222316
8.9%
5 215575
8.6%
3 197981
7.9%
9 194840
7.8%
7 194113
7.7%
6 193488
7.7%
8 187133
7.5%
Other values (66) 225391
9.0%

target
Text

Distinct43939
Distinct (%)11.7%
Missing2481
Missing (%)0.7%
Memory size2.9 MiB
2024-11-26T12:57:57.332189image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length18
Median length8
Mean length7.9461522
Min length1

Characters and Unicode

Total characters2977455
Distinct characters18
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29774 ?
Unique (%)7.9%

Sample

1st row$418,000
2nd row$310,000
3rd row$2,895,000
4th row$2,395,000
5th row$5,000
ValueCountFrequency (%)
225,000 1806
 
0.5%
275,000 1650
 
0.4%
250,000 1644
 
0.4%
350,000 1641
 
0.4%
325,000 1562
 
0.4%
399,000 1547
 
0.4%
299,900 1534
 
0.4%
249,900 1500
 
0.4%
299,000 1452
 
0.4%
375,000 1442
 
0.4%
Other values (34326) 358928
95.8%
2024-11-26T12:57:57.823609image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 984828
33.1%
, 418551
14.1%
9 332567
 
11.2%
$ 308823
 
10.4%
5 183665
 
6.2%
2 147747
 
5.0%
1 138827
 
4.7%
4 121560
 
4.1%
3 110323
 
3.7%
7 77982
 
2.6%
Other values (8) 152582
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2241621
75.3%
Other Punctuation 418949
 
14.1%
Currency Symbol 308823
 
10.4%
Math Symbol 7263
 
0.2%
Lowercase Letter 796
 
< 0.1%
Space Separator 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 984828
43.9%
9 332567
 
14.8%
5 183665
 
8.2%
2 147747
 
6.6%
1 138827
 
6.2%
4 121560
 
5.4%
3 110323
 
4.9%
7 77982
 
3.5%
8 75823
 
3.4%
6 68299
 
3.0%
Other Punctuation
ValueCountFrequency (%)
, 418551
99.9%
/ 398
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
m 398
50.0%
o 398
50.0%
Currency Symbol
ValueCountFrequency (%)
$ 308823
100.0%
Math Symbol
ValueCountFrequency (%)
+ 7263
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2976659
> 99.9%
Latin 796
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 984828
33.1%
, 418551
14.1%
9 332567
 
11.2%
$ 308823
 
10.4%
5 183665
 
6.2%
2 147747
 
5.0%
1 138827
 
4.7%
4 121560
 
4.1%
3 110323
 
3.7%
7 77982
 
2.6%
Other values (6) 151786
 
5.1%
Latin
ValueCountFrequency (%)
m 398
50.0%
o 398
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2977455
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 984828
33.1%
, 418551
14.1%
9 332567
 
11.2%
$ 308823
 
10.4%
5 183665
 
6.2%
2 147747
 
5.0%
1 138827
 
4.7%
4 121560
 
4.1%
3 110323
 
3.7%
7 77982
 
2.6%
Other values (8) 152582
 
5.1%

Missing values

2024-11-26T12:57:39.463203image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-11-26T12:57:40.282683image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-11-26T12:57:42.498319image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

statusprivate poolpropertyTypestreetbathshomeFactsfireplacecityschoolssqftzipcodebedsstatestoriesmls-idPrivatePoolMlsIdtarget
0ActiveNaNSingle Family Home240 Heather Ln3.5{'atAGlanceFacts': [{'factValue': '2019', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': 'Central A/C, Heat Pump', 'factLabel': 'Heating'}, {'factValue': '', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': None, 'factLabel': 'lotsize'}, {'factValue': '$144', 'factLabel': 'Price/sqft'}]}Gas LogsSouthern Pines[{'rating': ['4', '4', '7', 'NR', '4', '7', 'NR', 'NR'], 'data': {'Distance': ['2.7 mi', '3.6 mi', '5.1 mi', '4.0 mi', '10.5 mi', '12.6 mi', '2.7 mi', '3.1 mi'], 'Grades': ['3–5', '6–8', '9–12', 'PK–2', '6–8', '9–12', 'PK–5', 'K–12']}, 'name': ['Southern Pines Elementary School', 'Southern Middle School', 'Pinecrest High School', 'Southern Pines Primary School', "Crain's Creek Middle School", 'Union Pines High School', 'Episcopal Day Private School', 'Calvary Christian Private School']}]2900283874NCNaNNaNNaN611019$418,000
1for saleNaNsingle-family home12911 E Heroy Ave3 Baths{'atAGlanceFacts': [{'factValue': '2019', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': '', 'factLabel': 'Heating'}, {'factValue': '', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '5828 sqft', 'factLabel': 'lotsize'}, {'factValue': '$159/sqft', 'factLabel': 'Price/sqft'}]}NaNSpokane Valley[{'rating': ['4/10', 'None/10', '4/10'], 'data': {'Distance': ['1.65mi', '1.32mi', '1.01mi'], 'Grades': ['9-12', '3-8', 'PK-8']}, 'name': ['East Valley High School&Extension', 'Eastvalley Middle School', 'Trentwood Elementary School']}]1,947 sqft992163 BedsWA2.0NaNNaN201916904$310,000
2for saleNaNsingle-family home2005 Westridge Rd2 Baths{'atAGlanceFacts': [{'factValue': '1961', 'factLabel': 'Year built'}, {'factValue': '1967', 'factLabel': 'Remodeled year'}, {'factValue': 'Forced Air', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': 'Attached Garage', 'factLabel': 'Parking'}, {'factValue': '8,626 sqft', 'factLabel': 'lotsize'}, {'factValue': '$965/sqft', 'factLabel': 'Price/sqft'}]}yesLos Angeles[{'rating': ['8/10', '4/10', '8/10'], 'data': {'Distance': ['1.19mi', '2.06mi', '2.63mi'], 'Grades': ['6-8', 'K-5', '9-12']}, 'name': ['Paul Revere Middle School', 'Brentwood Science School', 'Palisades Charter High School']}]3,000 sqft900493 BedsCA1.0NaNyesFR19221027$2,895,000
3for saleNaNsingle-family home4311 Livingston Ave8 Baths{'atAGlanceFacts': [{'factValue': '2006', 'factLabel': 'Year built'}, {'factValue': '2006', 'factLabel': 'Remodeled year'}, {'factValue': 'Forced Air', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': 'Detached Garage', 'factLabel': 'Parking'}, {'factValue': '8,220 sqft', 'factLabel': 'lotsize'}, {'factValue': '$371/sqft', 'factLabel': 'Price/sqft'}]}yesDallas[{'rating': ['9/10', '9/10', '10/10', '9/10'], 'data': {'Distance': ['1.05mi', '0.1mi', '1.05mi', '0.81mi'], 'Grades': ['5-6', 'PK-4', '7-8', '9-12']}, 'name': ['Mcculloch Intermediate School', 'Bradfield Elementary School', 'Highland Park Middle School', 'Highland Park High School']}]6,457 sqft752055 BedsTX3.0NaNNaN14191809$2,395,000
4for saleNaNlot/land1524 Kiscoe StNaN{'atAGlanceFacts': [{'factValue': '', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': '', 'factLabel': 'Heating'}, {'factValue': '', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '10,019 sqft', 'factLabel': 'lotsize'}, {'factValue': None, 'factLabel': 'Price/sqft'}]}NaNPalm Bay[{'rating': ['4/10', '5/10', '5/10'], 'data': {'Distance': ['5.96mi', '3.25mi', '3.03mi'], 'Grades': ['7-8', '9-12', 'PK-6']}, 'name': ['Southwest Middle School', 'Bayside High School', 'Westside Elementary School']}]NaN32908NaNFLNaNNaNNaN861745$5,000
5for saleNaNtownhouse1624 S Newkirk StNaN{'atAGlanceFacts': [{'factValue': '1920', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': 'Forced Air', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '680 sqft', 'factLabel': 'lotsize'}, {'factValue': '$233/sqft', 'factLabel': 'Price/sqft'}]}NaNPhiladelphia[{'rating': [], 'data': {'Distance': [], 'Grades': []}, 'name': []}]897 sqft191452 BedsPA2.0NaNNaNPAPH847006$209,000
6ActiveNaNFlorida552 Casanova CtNaN{'atAGlanceFacts': [{'factValue': '2006', 'factLabel': 'Year built'}, {'factValue': '2006', 'factLabel': 'Remodeled year'}, {'factValue': 'Electric, Heat Pump', 'factLabel': 'Heating'}, {'factValue': 'Central Air', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '4,996 Sq. Ft.', 'factLabel': 'lotsize'}, {'factValue': '$120 / Sq. Ft.', 'factLabel': 'Price/sqft'}]}NaNPOINCIANA[{'rating': ['3', '3', '1', 'NR'], 'data': {'Distance': ['0.8 mi', '8.3 mi', '4.2 mi', '2.0 mi'], 'Grades': ['Preschool to 4', 'Preschool to 12', '5 to 8', '1 to 12']}, 'name': ['Palmetto Elementary School', 'Haines City Senior High School', 'Lake Marion Creek Elementary School', 'Chosen Generation Christian Academy']}]1,50734759NaNFLOneNaNNaNS5026943181,500
7ActiveNaNNaN6094 Mingle DrNaN{'atAGlanceFacts': [{'factValue': '1976', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': '', 'factLabel': 'Heating'}, {'factValue': '', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '8,750 Sq. Ft.', 'factLabel': 'lotsize'}, {'factValue': '$57 / Sq. Ft.', 'factLabel': 'Price/sqft'}]}NaNMemphis[{'rating': ['4', '2', '2'], 'data': {'Distance': ['0.7 mi', '0.4 mi', '2.2 mi'], 'Grades': ['Preschool to 5', '6 to 8', '9 to 12']}, 'name': ['Crump Elementary School', 'Hickory Ridge Middle School', 'Wooddale High School']}]NaN38115NaNTNNaNNaNNaN1006350668,000
8ActiveNaNSingle Family Home11182 Owl Ave2{'atAGlanceFacts': [{'factValue': '1970', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': 'Forced Air', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '124582', 'factLabel': 'lotsize'}, {'factValue': '$68', 'factLabel': 'Price/sqft'}]}NaNMason City[{'rating': ['2', '2', '4', '7', '4', 'NR'], 'data': {'Distance': ['5.6 mi', '5.6 mi', '6.8 mi', '6.5 mi', '6.8 mi', '6.8 mi'], 'Grades': ['PK–4', '5–6', '9–12', 'PK–4', '7–8', '9–12']}, 'name': ['Roosevelt Elementary School', 'Lincoln Intermediate School', 'Mason City High School', 'Jefferson Elementary School', 'John Adams Middle School', 'Alternative School']}]3588504013IANaNNaNNaN190988$244,900
9NaNNaNSingle Family8612 Cedar Plains Ln3{'atAGlanceFacts': [{'factValue': '2019', 'factLabel': 'Year built'}, {'factValue': None, 'factLabel': 'Remodeled year'}, {'factValue': 'Gas', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': 'Attached Garage', 'factLabel': 'Parking'}, {'factValue': '2,056 sqft', 'factLabel': 'lotsize'}, {'factValue': '$162', 'factLabel': 'Price/sqft'}]}NaNHouston[{'rating': ['4/10', '3/10', '2/10'], 'data': {'Distance': ['0.7 mi', '0.6 mi', '1.9 mi'], 'Grades': ['PK-5', '5-8', '9-12']}, 'name': ['Edgewood Elementary School', 'Landrum Middle School', 'Northbrook High School']}]1,930770803TX2.0NaNNaN73968331$311,995
statusprivate poolpropertyTypestreetbathshomeFactsfireplacecityschoolssqftzipcodebedsstatestoriesmls-idPrivatePoolMlsIdtarget
377175for saleNaNsingle-family home9711 Lawngate Dr3 Baths{'atAGlanceFacts': [{'factValue': '1970', 'factLabel': 'Year built'}, {'factValue': '1970', 'factLabel': 'Remodeled year'}, {'factValue': 'Other', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': 'Detached Garage', 'factLabel': 'Parking'}, {'factValue': '6,599 sqft', 'factLabel': 'lotsize'}, {'factValue': '$156/sqft', 'factLabel': 'Price/sqft'}]}yesHouston[{'rating': ['2/10', '3/10', '3/10'], 'data': {'Distance': ['0.65mi', '1.15mi', '0.19mi'], 'Grades': ['9-12', 'PK-5', '6-8']}, 'name': ['Northbrook High School', 'Terrace Elementary School', 'Northbrook Middle School']}]1,792 sqft770804 BedsTX2.0NaNNaN74136719$280,000
377176NaNNaNSingle Family3263 Wolcott Pl2.0{'atAGlanceFacts': [{'factValue': '1962', 'factLabel': 'Year built'}, {'factValue': '1967', 'factLabel': 'Remodeled year'}, {'factValue': 'Forced air', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': '1 space', 'factLabel': 'Parking'}, {'factValue': '7,704 sqft', 'factLabel': 'lotsize'}, {'factValue': None, 'factLabel': 'Price/sqft'}]}YesOrlando[{'rating': ['3/10', '1/10', '3/10'], 'data': {'Distance': ['1.5 mi', '1.3 mi', '1.1 mi'], 'Grades': ['PK-5', '6-8', '9-12']}, 'name': ['Washington Shores Elementary School', 'Carver Middle School', 'Jones High School']}]1,829 sqft328053FL1NaNNaNNaN$171,306
377177ActiveNaNSingle Detached, Traditional2805 S Jennings AveNaN{'atAGlanceFacts': [{'factValue': '1921', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': '', 'factLabel': 'Heating'}, {'factValue': 'Central A/C (Electric), Central Heat (Electric)', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '7,500 Sq. Ft.', 'factLabel': 'lotsize'}, {'factValue': '$105 / Sq. Ft.', 'factLabel': 'Price/sqft'}]}NaNFort Worth[{'rating': ['4', '6', '5'], 'data': {'Distance': ['0.5 mi', '2.0 mi', '1.3 mi'], 'Grades': ['Preschool to 5', '6 to 8', '9 to 12']}, 'name': ['Daggett Elementary School', 'Rosemont Middle School', 'Paschal High School']}]1,89576110NaNTXNaNNaNNaN14087883199,900
377178NaNNaNSingle FamilyBuildable plan: The Torino (384L) Riverstone Ranch - Premier2{'atAGlanceFacts': [{'factValue': '2019', 'factLabel': 'Year built'}, {'factValue': None, 'factLabel': 'Remodeled year'}, {'factValue': 'No Data', 'factLabel': 'Heating'}, {'factValue': 'No Data', 'factLabel': 'Cooling'}, {'factValue': '2 spaces', 'factLabel': 'Parking'}, {'factValue': 'No Data', 'factLabel': 'lotsize'}, {'factValue': '$137', 'factLabel': 'Price/sqft'}]}NaNHouston[{'rating': ['7/10', '6/10', '5/10'], 'data': {'Distance': ['0.3 mi', '2.5 mi', '2.5 mi'], 'Grades': ['PK-4', '7-8', '9-12']}, 'name': ['South Belt Elementary School', 'Thompson Intermediate School', 'Dobie High School']}]1,841770894TX1.0NaNNaNNaN$252,990
377179For saleNaNCondo2238 11th St NW APT 23{'atAGlanceFacts': [{'factValue': '2010', 'factLabel': 'Year built'}, {'factValue': None, 'factLabel': 'Remodeled year'}, {'factValue': 'Forced air', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': '1 space', 'factLabel': 'Parking'}, {'factValue': None, 'factLabel': 'lotsize'}, {'factValue': '$564', 'factLabel': 'Price/sqft'}]}NaNWashington[{'rating': ['3/10', '3/10'], 'data': {'Distance': ['0.4 mi', '0.1 mi'], 'Grades': ['PK-5', '6-12']}, 'name': ['Garrison Elementary School', 'Cardozo Education Campus']}]1,417200012DC3.0NaNNaNDCDC444306$799,000
377180NaNNaNSingle Family20800 NE 23rd Ave6.0{'atAGlanceFacts': [{'factValue': '1990', 'factLabel': 'Year built'}, {'factValue': '1990', 'factLabel': 'Remodeled year'}, {'factValue': 'Other', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': '2 spaces', 'factLabel': 'Parking'}, {'factValue': '8,500 sqft', 'factLabel': 'lotsize'}, {'factValue': '$311', 'factLabel': 'Price/sqft'}]}NaNMiami[{'rating': ['10/10', '5/10'], 'data': {'Distance': ['32.1 mi', '1.1 mi'], 'Grades': ['PK-8', '9-12']}, 'name': ['Air Base Elementary School', 'Dr Michael M. Krop Senior High School']}]4,017331805FL0.0NaNYesA10702700$1,249,000
377181for saleNaNcondo3530 N Lake Shore Dr #4B3 Baths{'atAGlanceFacts': [{'factValue': '1924', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': 'Radiant', 'factLabel': 'Heating'}, {'factValue': '', 'factLabel': 'Cooling'}, {'factValue': 'None', 'factLabel': 'Parking'}, {'factValue': '', 'factLabel': 'lotsize'}, {'factValue': '$337/sqft', 'factLabel': 'Price/sqft'}]}NaNChicago[{'rating': ['1/10', '5/10', '7/10'], 'data': {'Distance': ['10.61mi', '1.42mi', '0.4mi'], 'Grades': ['9-12', '9-12', 'PK-8']}, 'name': ['Hope College Prep High School', 'Lake View High School', 'Nettelhorst Elementary School']}]2,000 sqft606573 BedsIL9.0NaNNaN10374233$674,999
377182for saleNaNsingle-family home15509 Linden Blvd3 Baths{'atAGlanceFacts': [{'factValue': '1950', 'factLabel': 'Year built'}, {'factValue': '1950', 'factLabel': 'Remodeled year'}, {'factValue': 'Other', 'factLabel': 'Heating'}, {'factValue': '', 'factLabel': 'Cooling'}, {'factValue': '2', 'factLabel': 'Parking'}, {'factValue': '1,600 sqft', 'factLabel': 'lotsize'}, {'factValue': '$458/sqft', 'factLabel': 'Price/sqft'}]}NaNJamaica[{'rating': ['5/10', '4/10'], 'data': {'Distance': ['0.48mi', '0.73mi'], 'Grades': ['PK-5', '6-8']}, 'name': ['Ps 48 William Wordsworth', 'Jhs 8 Richard S Grossley']}]1,152 sqft114343 BedsNY2NaNNaNNaN$528,000
377183NaNNaNNaN7810 Pereida StNaN{'atAGlanceFacts': [{'factValue': None, 'factLabel': 'Year built'}, {'factValue': None, 'factLabel': 'Remodeled year'}, {'factValue': None, 'factLabel': 'Heating'}, {'factValue': None, 'factLabel': 'Cooling'}, {'factValue': None, 'factLabel': 'Parking'}, {'factValue': None, 'factLabel': 'lotsize'}, {'factValue': None, 'factLabel': 'Price/sqft'}]}NaNHouston[{'rating': ['NA', 'NA', 'NA'], 'data': {'Distance': ['1.3 mi', '0.5 mi', '1.9 mi'], 'Grades': ['PK-5', '6-8', '9-12']}, 'name': ['Hiliard El', 'Forest Brook Middle', 'North Forest High School']}]NaN770288,479 sqftTXNaNNaNNaNNaN$34,500
377184NaNNaNSingle Family5983 Midcrown Dr2.0{'atAGlanceFacts': [{'factValue': '2019', 'factLabel': 'Year built'}, {'factValue': None, 'factLabel': 'Remodeled year'}, {'factValue': 'Electric', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': 'No Data', 'factLabel': 'Parking'}, {'factValue': '6,969 sqft', 'factLabel': 'lotsize'}, {'factValue': '$140', 'factLabel': 'Price/sqft'}]}Not ApplicableSan Antonio[{'rating': ['5/10', '4/10', '3/10'], 'data': {'Distance': ['0.3 mi', '1.1 mi', '4.1 mi'], 'Grades': ['PK-5', '6-8', '9-12']}, 'name': ['Mary Lou Hartman', 'Woodlake Hills Middle School', 'Judson High School']}]1,462782183TX1.0NaNNaN1403619$204,900

Duplicate rows

Most frequently occurring

statusprivate poolpropertyTypestreetbathshomeFactsfireplacecityschoolssqftzipcodebedsstatestoriesmls-idPrivatePoolMlsIdtarget# duplicates
46for saleNaNtownhouseThe Lockland 27-33 Plan in Landen Pine3.5 Baths{'atAGlanceFacts': [{'factValue': '', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': '', 'factLabel': 'Heating'}, {'factValue': '', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '', 'factLabel': 'lotsize'}, {'factValue': '$299/sqft', 'factLabel': 'Price/sqft'}]}NaNAtlanta[{'rating': ['6/10', '6/10', '6/10'], 'data': {'Distance': ['3.85mi', '0.65mi', '1.88mi'], 'Grades': ['10-12', 'PK-5', '6-8']}, 'name': ['North Atlanta High School', 'Smith Elementary School', 'Sutton Middle School']}]2,806 sqft303054 BedsGANaNNaNNaNNaN$839,900+3
0For saleNaNSingle Family11207 NE 127th Ave2.0{'atAGlanceFacts': [{'factValue': '2015', 'factLabel': 'Year built'}, {'factValue': '2015', 'factLabel': 'Remodeled year'}, {'factValue': 'Forced air', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': '2 spaces', 'factLabel': 'Parking'}, {'factValue': '5,662 sqft', 'factLabel': 'lotsize'}, {'factValue': '$246', 'factLabel': 'Price/sqft'}]}NaNVancouver[{'rating': ['4/10', '4/10', '6/10'], 'data': {'Distance': ['2.1 mi', '2.1 mi', '0.8 mi'], 'Grades': ['PK-4', '5-8', '9-12']}, 'name': ['Glenwood Heights Primary School', 'Laurin Middle School', 'Prairie High School']}]1,670986823WA1NaNNaN19047778$410,0002
1New constructionNaNMulti Family335 H St NENaN{'atAGlanceFacts': [{'factValue': '2017', 'factLabel': 'Year built'}, {'factValue': None, 'factLabel': 'Remodeled year'}, {'factValue': 'Forced air', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': 'No Data', 'factLabel': 'Parking'}, {'factValue': None, 'factLabel': 'lotsize'}, {'factValue': '$651', 'factLabel': 'Price/sqft'}]}NaNWashington[{'rating': ['8/10', '6/10', '4/10'], 'data': {'Distance': ['0.2 mi', '0.2 mi', '1.5 mi'], 'Grades': ['PK-5', '6-8', '9-12']}, 'name': ['Ludlow-Taylor Elementary School', 'Stuart-Hobson Middle School', 'Eastern High School']}]3,600 sqft200020DC4NaNNaN1000123741$2,345,0002
2for saleNaNapartment269 W 87th St #B6 Baths{'atAGlanceFacts': [{'factValue': '2018', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': '', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '', 'factLabel': 'lotsize'}, {'factValue': '$2,447/sqft', 'factLabel': 'Price/sqft'}]}NaNNew York[{'rating': ['8/10'], 'data': {'Distance': ['0.23mi'], 'Grades': ['K-5']}, 'name': ['Ps 166 The Richard Rogers School Of The Arts And S']}]3,882 sqft100245 BedsNYNaNNaNNaN1245762$9,500,0002
3for saleNaNcondo1517 Briarcliff Rd NE #C3 Baths{'atAGlanceFacts': [{'factValue': '2018', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': 'Forced Air', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': 'Attached Garage', 'factLabel': 'Parking'}, {'factValue': '', 'factLabel': 'lotsize'}, {'factValue': '$368/sqft', 'factLabel': 'Price/sqft'}]}NaNAtlanta[{'rating': ['5/10', '5/10', '6/10'], 'data': {'Distance': ['1.02mi', '1.11mi', '3.91mi'], 'Grades': ['PK-5', '9-12', '6-8']}, 'name': ['Briar Vista Elementary School', 'Druid Hills High School', 'Druid Hills Middle School']}]2,386 sqft303062 BedsGANaNNaNNaN6126572$877,8602
4for saleNaNcondo184 Kent Ave #A314NaN{'atAGlanceFacts': [{'factValue': '1915', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': '', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '', 'factLabel': 'lotsize'}, {'factValue': '$1,369/sqft', 'factLabel': 'Price/sqft'}]}NaNBrooklyn[{'rating': ['6/10'], 'data': {'Distance': ['0.5mi'], 'Grades': ['PK-5']}, 'name': ['Ps 17 Henry D Woodworth']}]712 sqft11249NaNNYNaNNaNNaN1123517$975,0002
5for saleNaNcondo184 Kent Ave #B3012 Baths{'atAGlanceFacts': [{'factValue': '1915', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': '', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '', 'factLabel': 'lotsize'}, {'factValue': '$1,855/sqft', 'factLabel': 'Price/sqft'}]}NaNBrooklyn[{'rating': ['6/10'], 'data': {'Distance': ['0.47mi'], 'Grades': ['PK-5']}, 'name': ['Ps 17 Henry D Woodworth']}]1,078 sqft112492 BedsNYNaNNaNNaN1062169$2,000,0002
6for saleNaNcondo519 Circle St #A4 Baths{'atAGlanceFacts': [{'factValue': '2007', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': 'Forced Air', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': 'Attached Garage', 'factLabel': 'Parking'}, {'factValue': '9,714 sqft', 'factLabel': 'lotsize'}, {'factValue': '$192/sqft', 'factLabel': 'Price/sqft'}]}yesAlamo Heights[{'rating': ['7/10', '7/10', '6/10'], 'data': {'Distance': ['0.75mi', '1.45mi', '0.12mi'], 'Grades': ['9-12', '6-8', '1-5']}, 'name': ['Alamo Heights High School', 'Alamo Heights J High School', 'Cambridge Elementary School']}]3,385 sqft782094 BedsTXNaNNaNNaN1363375$650,0002
7for saleNaNcoop152 W 58th St #12 Baths{'atAGlanceFacts': [{'factValue': '1916', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': '', 'factLabel': 'Heating'}, {'factValue': 'Central', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '', 'factLabel': 'lotsize'}, {'factValue': None, 'factLabel': 'Price/sqft'}]}yesNew York[{'rating': ['9/10', '8/10', '3/10', '8/10', '9/10', '5/10'], 'data': {'Distance': ['2.02mi', '2.02mi', '2.1mi', '2.16mi', '4.52mi', '0.58mi'], 'Grades': ['6-8', '9-12', '6-12', '6-11', '6-8', 'PK-5']}, 'name': ['Nyc Lab Ms For Collaborative Studies', 'Nyc Lab High School For Collaborative Studies', 'Life Sciences Secondary School', 'Ms 260 Clinton School Writers And Artists', 'Lower Manhattan Community Middle School', 'Ps 111 Adolph S Ochs']}]NaN100193 BedsNYNaNNaNNaN1230018$2,500,0002
8for saleNaNlot/land116 Country Club RdNaN{'atAGlanceFacts': [{'factValue': '', 'factLabel': 'Year built'}, {'factValue': '', 'factLabel': 'Remodeled year'}, {'factValue': '', 'factLabel': 'Heating'}, {'factValue': '', 'factLabel': 'Cooling'}, {'factValue': '', 'factLabel': 'Parking'}, {'factValue': '7840 sqft', 'factLabel': 'lotsize'}, {'factValue': None, 'factLabel': 'Price/sqft'}]}NaNAsheville[{'rating': ['8/10', '5/10', '5/10', '7/10', '4/10', '6/10', '4/10', '6/10'], 'data': {'Distance': ['3.34mi', '1.98mi', '3.34mi', '3.51mi', '0.95mi', '0.63mi', '2.46mi', '4.23mi'], 'Grades': ['9-12', 'PK-5', '9-12', 'PK-5', 'K-5', 'PK-5', '6-8', 'K-5']}, 'name': ['School Of Inquiry And Life Science', 'Isaac Dickson Elementary', 'Asheville High', 'Hall Fletcher Elementary', 'Claxton Elementary', 'Ira B Jones Elementary', 'Asheville Middle', 'Vance Elementary']}]NaN28804NaNNCNaNNaNNaN3462298$169,9002